Support dataframe to tsfile #706

ColinLeeo · 2026-01-14T14:34:26Z

def dataframe_to_tsfile(dataframe: pd.DataFrame,
                        file_path: str,
                        table_name: Optional[str] = None,
                        time_column: Optional[str] = None,
                        tag_column: Optional[list[str]] = None,
                        ):
    """
    Write a pandas DataFrame to a TsFile by inferring the table schema from the DataFrame.
    This function automatically infers the table schema based on the DataFrame's column
    names and data types, then writes the data to a TsFile.
    Parameters
    ----------
    dataframe : pd.DataFrame
        The pandas DataFrame to write to TsFile.
        - If a 'time' column (case-insensitive) exists, it will be used as the time column.
        - Otherwise, the DataFrame index will be used as timestamps.
        - All other columns will be treated as data columns.
    file_path : str
        Path to the TsFile to write. Will be created if it doesn't exist.
    table_name : Optional[str], default None
        Name of the table. If None, defaults to "table".
    time_column : Optional[str], default None
        Name of the time column. If None, will look for a column named 'time' (case-insensitive),
        or use the DataFrame index if no 'time' column is found.
    tag_column : Optional[list[str]], default None
        List of column names to be treated as TAG columns. All other columns will be FIELD columns.
        If None, all columns are treated as FIELD columns.
    Returns
    -------
    None
    Raises
    ------
    ValueError
        If the DataFrame is empty or has no data columns.
    """

codecov-commenter · 2026-01-14T15:00:17Z

Codecov Report

❌ Patch coverage is 0% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 61.80%. Comparing base (052ff6b) to head (e625b76).

Files with missing lines	Patch %	Lines
cpp/src/cwrapper/tsfile_cwrapper.cc	0.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #706      +/-   ##
===========================================
- Coverage    61.80%   61.80%   -0.01%     
===========================================
  Files          709      709              
  Lines        40375    40378       +3     
  Branches      5686     5687       +1     
===========================================
  Hits         24954    24954              
- Misses       14723    14726       +3     
  Partials       698      698

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

python/tests/test_dataframe.py

jt2594838 · 2026-02-09T01:32:20Z

cpp/src/cwrapper/tsfile_cwrapper.cc

+        if (cur_schema->column_category == TIME) {
+            continue;
+        }


If you skip the time column, it will be missing when the file is loaded into IoTDB.

jt2594838 · 2026-02-09T01:35:25Z

python/tests/test_to_tsfile.py

+        df_read = to_dataframe(tsfile_path, table_name="test_table")
+        df_read = df_read.sort_values('time').reset_index(drop=True)
+        df_sorted = convert_to_nullable_types(df.sort_values('timestamp').reset_index(drop=True))
+
+        assert df_read.shape == (30, 3)
+        assert df_read["time"].equals(df_sorted["timestamp"])
+        assert df_read["device"].equals(df_sorted["device"])
+        assert df_read["value"].equals(df_sorted["value"])


The name of the time column should be prevserved.

ColinLeeo force-pushed the support_dataframe_to_tsfile branch from d6aa3e1 to 8c93df1 Compare January 14, 2026 14:37

ColinLeeo force-pushed the support_dataframe_to_tsfile branch from 8c93df1 to 46de126 Compare January 14, 2026 15:09

ColinLeeo requested a review from jt2594838 January 14, 2026 15:15

ColinLeeo force-pushed the support_dataframe_to_tsfile branch 2 times, most recently from 9d28d1f to 9e1edf5 Compare January 16, 2026 03:41

jt2594838 approved these changes Feb 5, 2026

View reviewed changes

python/tests/test_dataframe.py Outdated Show resolved Hide resolved

ColinLeeo added 5 commits February 9, 2026 04:07

support save dataframe to tsfile.

2a1ba30

fix sort data.

64e465a

tmp code.

e405cb3

tmp code.

18971c8

tmp code.

1b3c275

ColinLeeo force-pushed the support_dataframe_to_tsfile branch from 9e1edf5 to 1b3c275 Compare February 8, 2026 20:08

ColinLeeo added 2 commits February 9, 2026 09:07

tmp code.

0f91284

fix import error.

e625b76

jt2594838 reviewed Feb 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support dataframe to tsfile #706

Support dataframe to tsfile #706

Uh oh!

ColinLeeo commented Jan 14, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Jan 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

jt2594838 Feb 9, 2026

Uh oh!

jt2594838 Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support dataframe to tsfile #706

Are you sure you want to change the base?

Support dataframe to tsfile #706

Uh oh!

Conversation

ColinLeeo commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

jt2594838 Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

jt2594838 Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ColinLeeo commented Jan 14, 2026 •

edited

Loading

codecov-commenter commented Jan 14, 2026 •

edited

Loading